Baseline Experiments in the Extraction of Polish Valence Frames

نویسندگان

  • Adam Przepiórkowski
  • Jakub Fast
چکیده

Initial experiments in learning valence (subcategorisation) frames of Polish verbs from a morphosyntactically annotated corpus are reported here. The learning algorithm consists of a linguistic module, responsible for very simple shallow parsing of the input text (nominal and prepositional phrase recognition) and for the identification of valence frame cues (hypotheses), and a statistical module which implements three well-known inferential statistics (likelihood ratio, t test, binomial miscue probability test). The results of the three statistics are evaluated and compared with a baseline approach of selecting frames on the basis of the relative frequencies of frame/verb co-occurrences. The results, while clearly reflecting the many deficiencies of the linguistic analysis and the inadequacy of the statistical measures employed here for a free word order language rich in ellipsis and morphosyntactic syncretisms, are nevertheless promising.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Aggregating Entries of Semantic Valence Dictionary of Polish Verbs

In this paper the phase of semantic valence dictionary of Polish verbs consisting in aggregating entries to semantically coherent sets is presented. Two methods: a simple agglomerative one and minimal spanning trees method are discussed and compared. Both methods use a predefined similarity measure of semantic frames.

متن کامل

Similarity measure between frames for Polish semantic valence dictionary

In this paper we define a similarity measure between semantic valence frames. It is based on a similarity measure between senses of verb arguments. Senses are represented by general semantic categories or wordnet synset. The measure is supposed to be used for aggregation of semantic valence frames that differ in NP/PP slot senses but represent the same meaning of the verb.

متن کامل

Towards the Automatic Acquisition of a Valence Dictionary for Polish

This article presents the evaluation of a valence dictionary for Polish produced with the help of shallow parsing techniques and compares those results to earlier results involving deep parsing. We show that the valence dictionary obtained with the use of shallow parsing attains higher quality when it is measured on the basis of a corpus of valence frames, while the dictionary produced with the...

متن کامل

Effect of Nail Polish on Pulse Oximetry Findings in Healthy Volunteers: A Randomized Clinical Trail

Background and Objectives: Pulse oximetry is the most common technique for monitoring hemoglobin oxygen saturation (SpO2). Different colors and brands of nail polish may cause disturbance in the reading and interpretation of oxygen saturation. The aim of this study was to determine the effect of different colors of nail polish on oxygen saturation measured by pulse oximeter. Methods: Thirty he...

متن کامل

Valence extraction using EM selection and co-occurrence matrices

This paper discusses two new procedures for extracting verb valences from raw texts, with an application to the Polish language. The first novel technique, the EM selection algorithm, performs unsupervised disambiguation of valence frame forests, obtained by applying a non-probabilistic deep grammar parser and some post-processing to the text. The second new idea concerns filtering of incorrect...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005